Gaussianizing the non-Gaussian lensing convergence field I: the performance of the 

Gaussianization 
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Motivated by recent works of Neyrinck et al. 2009 and Scherrer et al. 2010, we proposed a 
Gaussian transformation to Gaussianize the non-Gaussian lensing convergence field k. It performs 
a local monotonic transformation k — >■ y pixel by pixel to make the fine-scale one-point probabil- 
ity distribution function of the new variable y Gaussian. We tested whether the whole y field is 
Gaussian through N-body simulations. (1) We found that the proposed Gaussianization suppresses 
the non-Gaussianity by orders of magnitude, in measures of the skewness, the kurtosis, the 5th- 
and 6th-order cumulants of the y field smoothed over various angular scales, relative to that of 
the corresponding smoothed k field. The residual non-Gaussianities are often consistent with zero 
within the statistical errors. (2) The Gaussianization significantly suppresses the bispectrum. Fur- 
thermore, the residual scatters about zero, depending on the configuration in the Fourier space. (3) 
The Gaussianization works with even better performance for the 2D fields of the matter density 
projected over ~ 300/i~'^Mpc distance interval centered at 2 £ (0,2), which can be reconstructed 
from the weak lensing tomography. (4) We identified imperfectness and complexities of the proposed 
Gaussianization. We noticed weak residual non-Gaussianity in the y field. We verified the widely 
used logarithmic transformation as a good approximation to the Gaussian transformation. However, 
we also found noticeable deviations. 

PACS numbers: 98.80.-k; 98.65.Dx; 98.62.Ve; 98.62.Sb 



I. INTRODUCTION 



Weak gravitational lensing has been established as one 
of the most powerful probes of the dark miiverse 
Meanwhile, it is facing many challenges. One of them is 
the nonlinear evolution of the large scale structure (LSS). 
An inevitable consequence is the significant presence of 
non-Gaussianity in the weak lensing field at subdegree 
scales, which has been confirmed by observations Q. For 
this reason, the most widely used statistics, the power 
spectrum, fails to capture all the cosmological informa- 
tion and a significant amount of information is encoded 
in the three point lensing statistics and perhaps also 
in even higher-order statistics. It also increases cosmic 
variance in the power spectrum, introduces correlated er- 
rors between different modes and complicates the error 
analysis based on the power spectrum measurement 

A number of recent works on the 3D matter den- 
sity field lead us to propose a possible solution to the 
lensing non-Gaussianity problem. Neyrinck et al. 2009 
[13} showed that a local logarithmic transformation 6 — 
ln(l -I- 5) is able to significantly improve the cosmolog- 
ical information encoded in the 3D power spectrum of 
the new field. Since the transformation is nonlinear, it 
compresses information encoded in higher-order statis- 
tics of the original density field into the power spectrum 
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of the new field. This significant improvement in the cos- 
mological information also implies that the new field is 
significantly more Gaussian. 

Scherrer et al. 2010 [11] introduced a novel and pow- 
erful mathematical tool, the copula, to study the statis- 
tics of the LSS. The n-point copula is the function that 
relates the one-point CDF (cumulative distribution func- 
tion) to the joint n-point CDF. A mathematical theorem 
states that the copula always exist and must be unique 
(refer to [3] and references therein for details). Hence 
the one-point PDF (probability distribution function) 
and the full n-point copula {n = 2, 3 • • •) present a com- 
plete description of a field. A surprising finding by 
through N-body simulation is that, despite strong non- 
Gaussianity in the 3D matter distribution, the two-point 
copula is virtually Gaussian. This motivates the authors 
to postulate the Gaussian copula hypothesis, that all n- 
point copula functions are Gaussian. 

Copula is invariant under monotonic transformation. 
Hence the finding above implies that, if we find a local 
monotonic transformation to make the one-point PDF 
Gaussian, other statistics (n-point PDFs and equiva- 
lently n-point correlation functions) will be Gaussian. 
The whole field will then be Gaussian and hence cos- 
mological information otherwise encoded in higher-order 
statistics is now compressed into the power spectrum. 
This helps to understand the enhancement of cosmolog- 
ical information in the power spectrum by performing a 
logarithmic transformation ( ^13j | and also in their more 
recent work [3]). Since the one-point PDF of the 3D 
density distribution is know to be close to log-normal. 
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the logarithmic transformation makes the one-point PDF 
nearly Gaussian. Along with the nearly Gaussian copula 
as found by [3], the logarithmic transformation should 
indeed work well to Gaussianize the 3D density field and 
compress cosmological information otherwise encoded in 
higher-order statistics into the power spectrum [l6j . 

These studies focus on the 3D matter distribution. 
Nevertheless, this naturally motivates us to propose the 
following postulation on the 2D lensing field. Without 
loss of generality, the lensing field is described by the 
lensing convergence k throughout the paper. Our postu- 
lation, which is a natural and straightforward extension 
of the Gaussian copula hypothesis proposed in [l^ , then 
states that. 



• Local monotonic Gaussian transformation k - 
makes the whole y field (effectively) Gaussian. 



y 



Here, local monotonic Gaussian transformations are the 
transformations k ^ y that make the one-point PDF of 
the pixelized field Gaussian with rank order preserved. 
The grid size for pixelization should be suitable to extract 
cosmological information. The scale should be enough to 
resolve nonlinear regime, but meanwhile the shot noise 
should not dominate the statistics. We call the field 
binned into a uniform grid suitable to extract cosmo- 
logical information as fine-scale field. A field being (ef- 
fectively) Gaussian means that all the measures of non- 
Gaussianity are (significantly suppressed and) consistent 
with zero within the error bars, under the consideration 
on the existence of fluctuations in statistics due to a fi- 
nite volume. If this postulation is indeed correct, all cos- 
mological information is encoded in the transformation 
K — )• y and the power spectrum of y. This would signifi- 
cantly simplify extracting cosmological parameters from 
the lensing field. 

We tested this postulation using our numerical sim- 
ulations. The measures of non-Gaussianity that we in- 
vestigate include the cumulants up to 6th-order of the 
smoothed y field and the bispectrum of the fine-scale 
y field. By definition, the fine-scale y field derived by 
pixel-to-pixel mapping has Gaussian cumulants due to 
the forced Gaussianization. However, this does not ap- 
ply to the smoothed one, since by smoothing over several 
pixel scale, pixel-pixel correlation comes into the cumu- 
lants. We found that, for all these measures, the non- 
Gaussianity is suppressed by orders of magnitude and 
the residual non-Gaussianity is often negligible. From 
a practical viewpoint, these results, though limited to 
statistics no higher than 6th-order, prove the above pos- 
tulation. Hence it shows that the non-Gaussian lensing 
field can be indeed Gaussianized by a simple local vari- 
able transformation. Although all these tests are carried 
out against the 2D convergence field, it strongly supports 
the original Gaussian copula hypothesis on the 3D matter 
distribution 14 1. 

Our Gaussianization proposal is closely related to, 
but not the same as, the previous Gaussianization pro- 
posed to recover the primordial density distribution (e.g. 



[ITI I18|). Such proposal assumes an one-to-one mono- 
tonic correspondence between the observed density (mat- 
ter, galaxy, Lyman-a fiux, etc.) distribution and the pri- 
mordial one. Our Gaussianization relies on the empirical 
finding of (14] . However, it does not rely on the above 
assumption and can work surprisingly well even in the 
deeply nonlinear regime where this assumption breaks. 
But for the same reason, the Gaussianized field is not 
necessarily closely related to the primordial density dis- 
tribution. 



Our work complements several recent works, but 
through different and independent approaches. (1) fT§| 
found that a logarithmic transformation k — > hi{l+K/ kq) 
can significantly increase the information contained in 
the lensing power spectrum while reducing correlations 
between different multipole £ modes. This work focused 
on the two-point statistics (and its variance). Our work 
focuses on the direct measures of the non-Gaussianity, 
such as the skewness and the bispectrum. Both works are 
hence highly complementary and consistent, both sup- 
porting the feasibility of Gaussianizing the non-Gaussian 
lensing field. (2) j20l ? ] took a different approach to 
Gaussianize the 3D matter density field and the 2D lens- 
ing convergence field. By using nonlinear wavelet Weiner 
filtering, they are able to remove/suppress small scale 
non-Gaussian structure, rendering the fields more Gaus- 
sian and hence increasing the information contained in 
the 2D and 3D power spectra. (3) [2l| also investigated 
the information on cosmology contained in Gaussianized 
weak gravitational lensing convergence fields, when our 
work is under modification to publish. By employing 
Box-Cox transformations to determine optimal transfor- 
mations to Gaussianity, they developed analytical mod- 
els for the transformed power spectrum, including ef- 
fects of noise and smoothing. They found that optimized 
Box-Cox transformations perform better than logarith- 
mic transformation, but both yield very similar results 
for the signal-to-noise and parameter constraints in ideal 
case. When adding a realistic level of shape noise, all 
transformations perform poorly. 



Our paper is organized as follows: In ^jH] we briefly 
introduce the weak lensing basics and propose the Gaus- 
sianization method. Measures of the non-Gaussianity 
and hence measures of the Gaussianization performance 
are also introduced in this section. The simulation used 
to construct weak lensing convergence maps is described 
in mill The measures of the non-Gaussianity in sim- 
ulated lensing convergence maps are presented in WVl 
In S|V]we repeat the analysis for the 2D matter density 
field projected over the box size, which corresponds to 
the redshift resolved matter field reconstructed from the 
weak lensing tomography. In WII we outline key issues 
for further investigation. 
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II. GAUSSIANIZING THE LENSING 
CONVERGENCE FIELD 

The weak lensing basics can be found in review arti- 
cles and a textbook j22]- Under the Born approx- 
imation, the weak lensing convergence k of a source at 
redshift and direction n can be expressed as 0] 



/t(n, Zs) = / 5ni{n,z)W{z,Zs)dx ■ 



(1) 



Here, 5m is the matter overdensity. x = x{^) is the co- 
moving angular diameter distance to the lens redshift z. 
We conveniently express x iii units of the Hubble radius, 
X = x/{c/Ho), in which Hq is the Hubble constant to- 
day. The lensing kernel W{z, Zg) for a source at redshift 
Zg and a lens at redshift z is given by 



Wiz,zg) = + z)xiz) 



X{z) 



(2) 



when z < Zg and zero otherwise. Here fi™ is the cos- 
mological matter density of the universe in units of the 
critical density. The above expression is valid for the flat 
cosmology we consider throughout the paper. 

Weak lensing directly probes the matter overdensity 
projected along the line-of-sight from the observer to the 
source galaxies. Through the Zg dependence in k, the 
lensing tomography works to reconstruct the 3D mat- 
ter distribution, namely 5m{n, z). In reality, due to lim- 
ited source redshifts, measurement errors and the rela- 
tively wide and smooth lensing kernel, the reconstruc- 
tion is most feasible for the binned matter overdensity 
(5^, namely, the matter column density averaged over a 
narrow redshift bin. The one for the i-th redshift bin 
[zi — Azi/2 < z < Zi + Azi/2) is defined as 



Jzi-tzi/l^rn{n,z)dx 



(3) 



Here A^i is the comoving width of the i-th redshift bin. 



AXi = x[zi + 



Az 



X z 



Az, 



Eq. [T] can then be rewritten as 



K{h) 



6f{n)W,Ax^ 



(4) 



(5) 



where the weight Wi = W{zi, Zg). Eq. [5] is exact in the 
limit Axi — >■ such that we can neglect the variation 
of W over the given redshift bin. In reality, since the 
typical width of the lensing kernel W is comparable to 
the Hubble radius, Eq. [5] is an excellent approximation 
as long as Axi <C c/Hq = 3/i~^Gpc. 

Both the K(n) and 6f{n) fields are, in principle, ob- 
servable. Sf is a redshift resolved version of k. It con- 
tains more detailed information on the LSS evolution and 



hence the dark universe. For subdegree angular scales of 
interest and O(100/i~^Mpc) bin width, 5f each are effec- 
tively uncorrelated with each other. Understanding the 
statistics of Sf then allows us to predict the statistics of 
K through Eq. [5] in a straightforward fashion. 

Both K(n) and S'^{fi) are nonlinear and non-Gaussian, 
meaning that the two-point angular correlation function 
(and equivalently the power spectrum) fails to capture 
all cosmological information encoded in these fields. A 
straightforward remedy for such information loss is to 
go up through the n-point correlation hierarchy. How- 
ever calculating three-point and higher-order statistics 
are daunting for large data sets. Even worse, it is not 
clear where the information saturates. The Gaussianiza- 
tion on k and Sf provides an alternative. If succeed, it 
will truncate the analysis at the level of the two-point 
statistics and hence significantly simplify the analysis. 



A. The Gaussianization procedure 

We consider an idealized case that the lensing data is 
binned into a uniform grid suitable to extract cosmolog- 
ical information. For each pixel centered at direction n, 
we measure (and Sf{f±), where f± is the coordinate 
perpendicular to the line-of-sight). We call it a fine-scale 
lensing convergence (and S^) map. From such data we 
can directly measure the one-point PDF of k and S^. 
These PDFs are non-Gaussian, for subdegree pixel size. 

One can always find a set of monotonic local (pixel 
by pixel) transformations k — > y ((5^ — > y) such that 
the one-point PDF of the fine-scale y field is Gaussian. 
The numerical solution can be found by the following 
equations. 



PG{y)dy 



and 



P{K)dK , 



r PGiy)dy= [ P(<5^)d<5^ 

J — oo J —oo 



(6) 



(7) 



Here by definition Paiy) — exp{—y'^ /2ay) / y/znay is the 
Gaussian PDF and ay is the rms dispersion of y. 

Throughout the paper, we call the solution to the 
above equations as the Gaussian transformation. It is 
the transformation that we propose to Gaussianize the 
whole lensing convergence field. Such transformations 
are unique, up to a scaling y ay, where a oc ay re- 
flects the freedom in choosing ay. Such scaling does not 
affect the non-Gaussianity of the y maps. By choosing 
a suitable scaling, the y field can have other desirable 
properties. We will briefly discuss this issue later in 



B. Measures of the Gaussianization performance 

The performance of the proposed Gaussianization pro- 
cedure can be quantifled by various non-Gaussianity mea- 
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sures. We have investigated the cumulants up to the 6th 
order and the reduced bispectrum. 

(1) The rtth-order cumulants of the smoothed y field, 
denoted as K„{ys)- These are nontrivial checks, despite 
the fact that nth-order cumulants of the fine-scale y field 
are Gaussian by definition. Smoothing introduces pixel- 
pixel correlation and can give rise to non- Gaussian ys 
cumulants. We adopt the conventional definition of the 
normalized cumulants in statistics. 



K3 



(y|)3/2 
(y|)5/2 
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(2/1) 



(y|)3/2 ' 



(8) 
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The cumulants before the Gaussianization is defined by 
replacing ys with ks (and 5'g) smoothed in the same way. 

We caution that Kn differ from the cumulants «S*7i usu- 
ally adopted in LSS study (e.g. in the review article [23jV 
For example, = / There are two major rea- 
sons to define the cumulants Kn according to Eq. [H (i) 
Sn are not invariant under the transformation y ^ ay, 
while the non-Gaussianity is obviously invariant under 
this transformation. Since we will quantify the Gaussian- 
ization performance by comparing the cumulants prior 
to and posterior to the Gaussianization, the usual defini- 
tions Sn could mislead us. On the contrary, Kn defined 
above are invariant under the transformation y ^ ay and 
hence allows for unbiased comparison, (ii) Kn itself is a 
straightforward measure of the Gaussianization perfor- 
mance, since \Kn \ ^ 1 means weak non-Gaussianity. On 
the contrary, Sn does not have this property, again due 
to its dependence on the scaling y ^ ay. 

(2) The redefined reduced bispectrum q. For the 2D 
(5^ fields, we define 



(j(A;_L,l,/cj_,2,fc_L,3) 



[P{k^,i)P{k^^2)+cyc.] 



3/4 



(9) 



Here, k±^i {i = 1,2,3) are 2D wavevectors and 
Z)i^-L,* = 0- The bispectrum -B(A:±,ii ^±,2, ^±,3) = 
{S^{k±^i)S^{k±^2)S^{k±,3)) and the power spectrum 
P{k±) = {\5^{k±)\'^). For the lensing convergence maps, 
we replaced k±^i with the 2D multipole vector £. 

The q that we define has a power index 3 /4 in the de- 
nominator, which differs from the power index 1 in the 
capital Q widely used in the LSS literature. Although 
the power index 1 is the natural choice preferred by the 
2nd-order perturbation theory, it causes Q to vary under 
the transformation y — > ay. For this reason, Q is not suit- 
able to describe the Gaussianization performance. Con- 
versely, q is invariant under the transformation y ^ ay. 



Unless otherwise specified, we refer to q as the reduced 
bispectrum throughout the paper. 

Here we remind readers that the Gaussian transfor- 
mation does not reduce higher-order statistics in all sit- 
uations. Gaussianization has no effect on a field only 
containing O's and I's, which arises observationally if too 
small a cell size is used for galaxy number counting field. 
Only the field with all nth order copula being Gaussian 
can be fully Gaussianized with this Gaussianization pro- 
cedure. 



III. SIMULATED LENSING MAPS 

We numerically evaluate Kn and q and test the Gaus- 
sianization performance using simulated lensing maps. 
Our N-body simulations were run using the Gadget-2 
code |24| . They all have box size L = SOO/i-^Mpc and 
adopt the standard ACDM cosmology, with ilm = 0.266, 
17a = 1 - ^m, (T8 = 0.801, h = 0.71 and = 0.963. 
More details can be found in 



25[. The maps are gener- 



ated as follows. (1) For each output, we project along x, 
y and z directions respectively to generate three maps of 
6f{f±), where is the 2D coordinate in comoving dis- 
tance. (2) We then convert rj_ to the angular coordinate 
through h — r_L/xizi), where Zi is the redshift of the cor- 
responding output. (3) We have chosen the output red- 
shifts such that any two adjacent outputs are separated 
by the box size L = 300/i~^Mpc in comoving distance. 
So by stacking maps of 6f according to Eq. [5l we obtain 
maps of lensing convergence. Since each snapshot is ob- 
tained from the same initial conditions, to avoid artificial 
correlation, we randomly shift and rotate the snapshots 
utilizing the periodical boundary condition. 

We stack eight snapshots between z = and z = 1 
to make k maps. These lensing maps correspond to 
source redshift z w 1.02 and comoving source distance 
2400/i^iMpc. The map size is 7.64° x 7.64°. The maps 
all have 512^ uniform grids, corresponding to angular res- 
olution 0.9 . It is enough to resolve nonlinear regime. 

This map making algorithm is valid under the Born 
approximation. Higher-order lensing corrections (26j can 
be neglected in the present theoretical study. Nonethe- 
less, the process is still highly simplified in the sense that 
it ignores many complexities in real observations, such 
as the random shape error in cosmic shear measurement, 
irregular masks and nonuniform survey depth. We will 
briefly discuss the possible impact of these complexities 
later in the paper, but postpone quantitative study else- 
where. 



IV. GAUSSIANIZATION OF THE k FIELD 

It is well known that the lensing convergence field k is 
non-Gaussian at subdegree scale. We show one such non- 
Gaussian PDF of a fine-scale k map we constructed in 
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FIG. 1: The PDF of one k field (solid line) generated from 
our simulation. The source redshift zs = 1.02 and the pixel 
size is 0.9 . a is the rms dispersion in k. The dotted line is 
the Gaussian PDF. Nonlinear evolution pushes the k PDF to 
be skewed toward negative value of k, while develops a tail of 
large positive k. 
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FIG. 2: The ac-j/ relations for 20 different line-of-sight re- 
alizations (solid line). The Gaussian transformations show 
reasonably good agreement. Irregularities at large k are 
largely caused by rare massive halos in the line-of-sight. 
The dotted line is a fitting of generalized logarithmic form, 
y = aln(l -I- k/&) -|- c, with a and b, c = y(K — 0) as fitting 
parameters. We confirm that the logarithmic transformation 
is indeed a reasonably good approximation to the true Gaus- 
sianization transformation. However, it is not surprising for 
the departures between them. 

Fig|T] Nonlinear evolution generates and amplifies non- 
Gaussianity in the matter distribution. It drives P{k) to 
be skewed and shifts the peak to k < 0. 



A. The K-y relation 

The Gaussian transformations k ^ y defined by Eq. 
ini are shown in Fig. [21 for the 20 maps of k generated 
from our simulation. The 20 curves for each realization 



(map) are in reasonable agreement with each other, im- 
plying that the transformation is stable and insusceptible 
to field to field fluctuations [l^] • To fix the normalization, 
we require 



K — 

by numerically estimating the slope at zero point of the 
standard Gaussian transformation function (which make 
y fields be standard Gaussian distributed with unit devi- 
ation), and scaling the slope to unity. This condition 
makes the power spectrum at sufficiently linear scale 
more or less invariant under the transformation. This 
condition is also satisfied by the usual logarithmic trans- 
formation S ln(l + 5) (e.g. in [13, U^)- Nevertheless, 
our choice of the normalization is rather arbitrary. Later 
in ijVII we will discuss whether we can use this normaliza- 
tion freedom to not only Gaussianize but also linearize 
the matter density field. Many more simulations are re- 
quired to obtain the mean n-y relation, a key issue of 
further investigation. 

How well can the widely adopted logarithmic trans- 
formation describe the above n-y relation? Also plot- 
ted in Fig|2]is a fitting to a generalized logarithmic form 
y = aln(l + n/b) -f c with a, 6, c as fitting parameters. 
By definition, c = y{K = 0). So in the fitting we sim- 
ply fix c to be y{K = 0) averaged from 20 realizations. 
The best fitting we found is a = 0.0145, b = 0.0526 
and c — 0.00294. This fitting is reasonably good, con- 
firming the usefulness of the logarithmic transformation. 
However, Figj^] does show clear deviations from even a 
general logarithmic transformation defined above. It is 
not surprising that the fields are not exactly lognormally 
distributed, since the departures from lognormality have 
already been found by several works. 

The Gaussian transformation suppresses k of high pos- 
itive values relative to the one of lower values (Fig. [2]). 
Since these high n are largely responsible for the non- 
Gaussianity, we would expect the above transformation 
will suppress the overall non-Gaussianity to some extent. 

B. The cumulants 

The skewness of the smoothed k fields and y fields, 
as a function of the smoothing radius ^g, are shown 
in Fig. [3l We adopt a Gaussian smoothing function, 
with 6*5 e [0.5 ,9 ]. We somewhat arbitrarily choose the 
largest smoothing scale as 10 pixel size. Nevertheless, 
such scale is sufficient to take the pixel-pixel correlation 
into account and study the Gaussianization performance. 
The Gaussianization procedure suppresses the skewness 
by at least an order of magnitude for all smoothing radius 
investigated. The residual skewness at all 9s is consis- 
tent with zero within three times the simulation error 
bars. The performance at 0s ~^ is trivial, other than 
verifying the accuracy of the solution to Eq. [51 How- 
ever, the performance at larger ^5 is indeed surprising. 
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FIG. 3: The skewness K-i of the k fields (sohd Une) and the 
y fields (dotted line) smoothed with Gaussian filters of radius 
9s- The lines are averaged over 20 realizations and error bars 
drawn are the rms dispersions over the 20 realizations. The 
cumulants Kn (n = 3, • ■ •) are defined by Eq. [8]). They have 
different normalizations from that of the cumulants S„ widely 
adopted in the LSS literature. Each Kn is invariant under the 
scaling y ^ ay and hence avoid an ambiguity in quantifying 
the non-Gaussianity. 
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FIG. 4: Same as Fig. O but for the kurtosis K4. The Gaus- 
sianization suppresses K4 dramatically, so if 4(1/) is barely rec- 
ognizable in this plot. Refer to Fig. [7] for a clearer view. We 
caution the readers on the definition of Kn (Eq. [HI). 



The largest smoothing scale investigated is 6*5 9 , cor- 
responding to 10 pixel size. Smoothing over this radius 
includes complicated nonlocal correlation between pixels, 
whose non-Gaussianity is not guaranteed to be removed 
or reduced by the local Gaussianization procedure. Ex- 
actly what kind of feature in the LSS non-Gaussianity is 
responsible for the surprisingly good performance of the 
Gaussianization at large smoothing radius is an interest- 
ing topic in itself and worth investigating. 

Nevertheless, we find that the Gaussianization does not 
perfectly produce a Gaussian random field. For example, 
K3 at ^5 ~ 5 shows weak deviation ('^ 2a) from zero, 
implying that the non-Gaussianity in the nonlocal pixel 



FIG. 5: Same as Fig. O but for the 5th-order cumulants 
K5. K5 is suppressed by the Gaussianization by two orders 
of magnitude to effectively zero. Refer to Fig. [7] for a clearer 
view. We caution the readers on the definition of Kn (Eq. [8|. 
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FIG. 6: Same as Fig. [3l but for the 6th-order cumulants K^. 
We caution the readers on the definition of Kn (Eq. [S]). 



correlation is not completely removed by the Gaussian- 
ization. The nonlocal pixel correlation is better described 
by the bispectrum, in which we find more robust evidence 
on the residual non- Gaussianity ( CVCl) . 

The Gaussianization also suppresses the kurtosis, the 
5th and 6th order cumulants dramatically. The residuals 
nearly vanish for all smoothing radius 9s < 10 investi- 
gated (Fig. |4l[5]&|6l). Fig. [7] shows explicitly the sup- 
pression factors in cumulants up to the 6th order. For 
all these quantities at all smoothing radius, they are sup- 
pressed by at least an order of magnitude. All these re- 
sults demonstrate the excellent performance of the Gaus- 
sianization procedure. These results also provide strong 
and independent support of the Gaussian copula hypoth- 
esis in I14II. 
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FIG. 7: The efficiency of the Gaussianization, in terms of the 
ratio of high order cumulants before and after the Gaussian 
transformation. The solid line is for the skewness, the dotted 
line for the kurtosis, the dash line for the 5th-order cumulant, 
and the dotted dash line for the 6th-order cumulant. For all 
investigated scales, the cumulants are suppressed by at least 
a factor of ~ 20. Kn, defined through Eq. [S] is invariant 
under the transformation y — >■ ay, where a is an arbitrary 
constant. This invariance makes the ratio of the cumulants 
before and after the Gaussianization suitable for quantifying 
the Gaussianization performance. 



C. The bispectrum 

Another measure of non-Gaussianity is bispectrum 
Bihjhjh), which vanishes for Gaussian fields. The bis- 
pectrum in LSS study is often expressed in a reduced bis- 
pectrum Q{li, h, h) for convenience. However, Q defined 
in the conventional way is not invariant under the scaling 
y ^ ay whereas the non-Gaussianity should be. Hence, 
following the same argument in defining the cumulants, 
we define a reduced bispectrum q according to Eq. ^ 
which is invariant under the transformation y ay. 
We express q in the coordinate {l,u,a), where / = Zi, 
u = h/h and a = arccos(/i • l2)/hl2- 

The reduced bispectrum for various configurations are 
presented in FiglH where the solid lines are for the k fields 
and dotted are for the corresponding y fields. The curves 
are averaged over the 20 realizations and the error bars 
are the rms dispersions within the 20 realizations. We 
show 9 values of / e [200, 6000), which covers most range 
of interest for weak lensing cosmology. For most config- 
urations of {l,u,a), q is significantly suppressed by the 
Gaussianization. In many configurations the q{y)s are 
consistent with zero. Since the bispectrum is a measure 
of nonlocal pixel correlation, these results show effective- 
ness of the Gaussianization. This conclusion is consistent 
with the findings in previous section on the skewness and 
higher-order cumulants. 

The gs (and the bispectra) of the y field scatter about 
zero. This behavior is in sharp contrast to the qs of the n 
field, which is always positive, a characteristic behavior 



of the large scale structure. The skewness is an aver- 
age of the bispectrum over all configurations (although 
with different weighting). Positive and negative values 
of the bispectrum of different configurations largely can- 
cel in the average. This explains the larger suppression in 
the skewness (FigjT]) than in the bispectrum of individual 
configuration. 

Since q describes the non-Gaussianity of individual 
configuration, to some extent it is a more sensitive mea- 
sure of the performance of the Gaussianization. We find 
robust evidence for residual non-Gaussianity in some con- 
figurations of the bispectrum (e.g. some configurations 
with large a in Fig. [S]). This means that the non- 
Gaussian and nonlocal pixel correlation is not completely 
removed by the Gaussianization. It also confirms the 
residual nonlocal pixel correlation as the source of resid- 
ual non-Gaussianity. A direct implication is that the 
weak lensing convergence copula is not exactly Gaussian, 
otherwise the bispectrum of the y field would vanish. An- 
other implication is that perfect local Gaussianization 
transformation does not exist. 



V. GAUSSIANIZATION OF THE FIELDS 

Weak lensing tomography allows us to reconstruct the 
matter distribution as a function of redshift and hence 
infers the LSS evolution [2^. For this reason, we inves- 
tigate the Gaussianization of the field, the 2D matter 
density distribution projected over a narrow redshift bin. 

We generate maps on a uniform 512^ grid at various 
redshifts. The pixel size of the 6^ maps is then L/512 = 
O.Gft-^^Mpc. The one-point PDFs of the fine-scale 5^ 
maps are shown in Fig. [S] Clearly, nonlinear evolution 
generates and amplifies non-Gaussianity in the matter 
distribution. It drives P{S^) to be skewed and shifts the 
peak to < 0. This is most obvious at z < 2, the most 
relevant redshift range for weak lensing tomography. 



A. The relation 

We will basically reapply the analysis done to the k 
field to the 5^ fields. Again, we fix the normalization by 
setting 



dy 



= 1 



(11) 



5^=0 



This condition is designed to keep the power spectrum at 
sufficiently linear scale invariant under the transforma- 
tion 5^ —i' y, a. desirable property to study the structure 
evolution. 

The Gaussian transformations 6^-y a,t z — 0,1,2,3.5 
are shown in Fig. [TOl To check the stability of the trans- 
formation, we plot the S^-y relations on the three inde- 
pendent projections of the same output. FigllOl shows 
that the three curves for each output overlap with each 
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FIG. 8: The redefined reduced bispectrum qs before (solid lines) and after (dotted lines) the Gaussianization. The Gaussianiza- 
tion reduces the qs for almost all configurations and reduce many of them to be consistent with zero within la error. However, 
for some configurations the residual non-Gaussianity is still significant. 
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FIG. 9: The PDF of 5^ field at « = 0, 1, 2, 3.5. a is the corre- 
sponding rms dispersion per pixel. The peat height increases 
with decreasing redshift. 



other and are virtually indistinguishable. This conver- 
gence implies that the transformation is stable and in- 
sensitive to field to field fluctuations. 

As expected, the Gaussian transformation suppresses 



5^ of large amplitude relatively to 5^ of small ampli- 
tude. This is also what we would expect for the loga- 
rithmic transformation 5^ — ^ ln(l + 5^). However, closer 
look shows that it diflFers from the logarithmic transfor- 
mation. (1) We have checked that ln(l -|- 6^)-y is not a 
straight line, with noticeable departures from the loga- 
rithmic transformation. (2) The logarithmic transforma- 
tion and its generalization 5^ — > ln(l + /b) contains 
a fixed point (0,0). Here b is some constant. But the 
actual Gaussian transformation does not have this fixed 
point. The point y{S^ = 0) shifts toward larger posi- 
tive value and the point 6^{y = 0) shifts toward larger 
negative value, for decreasing redshift. This behavior is 
consistent with the picture of nonlinear evolution. At 
sufficiently high redshifts, underdense {5^ < 0) regions 
occupy half the cosmic volume. Towards lower redshifts, 
the volume occupied by underdense regions (voids) in- 
creases. This causes y{5^ = 0) to become increasingly 
positive. Meanwhile, in the voids evolves to more 
negative value. Since 6^{y = 0) is the threshold that 
voids with lower density occupy half the cosmic volume, 
it becomes increasingly negative with decreasing redshift. 
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FIG. 10: Panel (a) shows the Gaussianization relation 
From up to bottom, the lines correspond to 2; = 0,1,2,3.5. 
For each redshift, we plot the transformation functions ob- 
tained from 3 independent Cartesian directions, which con- 
verge well and are barely distinguishable. Panel (b) is zoom- 
in version near 5^ = 0. Notice the shift of y{5^ = 0) and 
S^{y = 0) with respect to the redshift. 



FIG. 11: Panel (a) shows the skewness of 5^ field and of y 
field depending on smoothing radius on averaging results of 
three directions of simulation box. Solid lines for z = and 
dotted ones for z = 1. Errors are calculated through three 
different directions. Panel (b) only shows the results of the y 
field such that we can see the residual non-Gaussianity clearly. 



B. The cumulants of the smoothed y{5^) fields 

We adopt the Gaussian window function with ra- 
dius Rs to smooth the S'^ field. The adopted Rs £ 
(0, 6)/i~^Mpc covers most nonlinear scale of interest. We 
show the results at two typical redshifts z = 0, 1 in Fig. 
[m m [H & [H We find that the cumulants up to 6th 
order are all significantly suppressed with respect to the 
corresponding values before the Gaussianization. We re- 
mind the readers again that these cumulants are defined 
in the usual language of statistics (Eq. and differ from 
those in LSS studies. 

FiglH shows the skewness before and after the 
Gaussianization. The error bars are estimated from the 
three directions. For all the redshifts and smoothing ra- 
dius Rs that we investigate, we find at least an order 
of magnitude or more suppression in K^^ meaning that 
the non-Gaussianity is significantly suppressed. Further- 



more, after the Gaussianization is now \K^\ <^ 1, 
meaning that the field is indeed nearly Gaussian for the 
measure of K^, . 

Despite this excellent performance of Gaussianization, 
it is not perfect. The panel (b) of Fig [TT] shows small, but 
robust departure from K3 = at large smoothing radius, 
meaning a residual non-Gaussianity in the y field. Again, 
like the case of the convergence field, large smooth- 
ing radius means the inclusion of more and more non- 
Gaussian pixel-pixel correlations, which are not neces- 
sarily removed by the local Gaussianization process. And 
again, through the bispectrum measurement in W G[ we 
show more robustly the residual non-Gaussianity in the 
pixel-pixel correlation. Nevertheless, the residual skew- 
ness is small. So the Gaussianization procedure works 
effectively. 

Fig. [T2] shows the results of the kurtosis K4 and Fig. 
[T51 shows the results of and Kg. Fig. [T3] shows 
the ratio of Kn {n = 3,4,5,6) posterior to and prior 
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FIG. 12: Similar to Fig. Illl but for K4. The Gaussianization 
significantly suppresses K4, so the data points of K4{y) are 
not recognizable in these plots. Refer to Fig ll4l for a clearer 
view. 
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FIG. 13: Similar to Fig. [Til but for JsTg.e. The Gaussian- 
ization significantly suppresses /Cs.e, so the data points of 
Ksfiiu) are not recognizable in these plots. Refer to Fig |14l 
for a clearer view. 



to Gaussianization. The Gaussianization suppresses Kn 
{n = 3,4,5,6) by more than 1 order of magnitude (Fig. 
[T?)l . Furthermore, we have \Kn\ ^ 1, meaning that the 
residual non-Gaussianity is indeed small and the y fields 
are effectively Gaussian. 



C. The bispectrum of the 3^ field 

The reduced bispectrum g at 2; = is shown in Fig. 
Uni Though z = is not the optimal redshift for the 
weak lensing tomography. We choose this particular red- 
shift because the non-Gaussianity is strongest in the 
field at 2 = but weakest in the k field. By testing the 
Gaussianization performance for the two extreme cases, 
we can have better understanding on the generality of 
the Gaussianization procedure. 

Again, the results are similar to the case of the con- 
vergence field. Overall, the Gaussianization pushes q to- 
ward zero for all configurations, a clear demonstration 
that Gaussianization works. Furthermore, the bispec- 
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FIG. 14: The ratio of cumulants before and after the trans- 
formation as a function of smoothing scale at 2 = 0. Solid 
lines are for the skewness, dotted lines for the kurtosis, dash 
lines for and dotted dash lines for Kg. The Gaussianiza- 
tion usually suppresses these non-Gaussian measures by two 
orders of magnitude or more. 
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FIG. 15: The redefined reduced bispectrum g of 5^ at z = 0, before (solid lines) and after (dotted lines) the Gaussianization. 
k is in unit of /i/Mpc. Notice that q differs from the widely adopted reduced bispectrum Q. The Gaussianization suppresses 
the non-Gaussianity to effectively zero for almost all configurations. However, we do find significant residual non-Gaussianity 
for some configurations, showing the imperfectness of the Gaussianization procedure. 



trum prior to Gaussianization is positive, whereas after- 
wards it is scattered about zero. Since the skewness is a 
linear integral of the bispectrum, positive and negative 
bispectrum in the y field partly cancel each other and 
result in larger suppression in the skewness, as shown in 

Fig. [ni&iii 

The suppression factor in q depends on the strength 
of nonlinearity in the corresponding configuration. The 
suppression factor is in general small, for small k and 
large a, which corresponds to weaker nonlinearity. On 
the other hand, for larger k and smaller a corresponding 
to stronger nonlinearity, q can be suppressed by orders of 
magnitude. This behavior is consistent with the suppres- 
sion in skewness, which is smaller for larger smoothing 
radii. 



VI. DISCUSSIONS AND CONCLUSIONS 

We have demonstrated that it is indeed possible to 
Gaussianize the lensing convergence k field and the un- 
derlying matter density field beyond one-point PDF, 
through a simple local transformation which is defined 



only to Gaussianize the one-point PDF. (1) The Gaus- 
sianization procedure utilizes a local monotonic transfor- 
mation K — >■ y (or — >■ y) to make one-point PDF of 
K (or S^) Gaussian. This is a well defined mathematical 
problem and we have numerically shown that the solu- 
tion is stable. (2) We quantified the performance of the 
Gaussianization beyond one-point PDF against various 
measures of the non-Gaussianity, such as the skewness, 
kurtosis, 5tli- and 6th-order cumulants of the smoothed 
fields, and the bispectrum. We found that the Gaussian- 
ization works surprisingly well. It suppresses the above 
non-Gaussianity measures by orders of magnitude and ef- 
fectively reduce them to zero. This implies that in many 
exercises we can treat the resulting y field as Gaussian. 
The Gaussianization procedure then compresses cosmo- 
logical information in higher-order statistics of the origi- 
nal lensing fields into the power spectrum of y. For this 
reason, analyzing weak lensing statistics can be signifi- 
cantly simplified. 

Although we have tested the Gaussianization through 
many realizations of the k and 6^ fields, all of them are 
generated from a single simulation of finite box size and 
number of particles. So our findings are inevitably af- 
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fected by numerical issues such as cosmic variance, mass 
and force resolution, shot noise and aliasing effect [29j . 
We do not expect that any of these factors will alter 
the major conclusion on the effectiveness of the Gaus- 
sian transformation. Nevertheless, these numerical issues 
could in particular have larger impact on the residual 
non-Gaussianity, which itself is weak and relatively hard 
to measure accurately. We can imagine that these nu- 
merical issues introduce error in P{k). Such error then 
nonlinearly propagates into the k ^ y relation and may 
then cause non-negligible under- or overestimation of the 
residual non-Gaussianity in the y field |_30]. However, it is 
very unlikely that the detected residual non-Gaussianity 
can be canceled exactly. In this sense, the detection of 
the residual non-Gaussianity is robust, although we need 
many more simulations to measure the amplitude to high 
precision. 

There are many key issues to be explored in future 
study. An incomplete list includes the following. 

• The applicability to real data. So far we only con- 
sider a highly idealized case, which is suitable for 
the purpose of theoretical modeling of weak lens- 
ing. The same Gaussianization in principle should 
be also applicable to real data. But application is 
complicated by various measurement errors in real 
data. How well does the Gaussianization work with 
the presence of measurement errors is a crucial issue 
for further investigation. One of such complexities 
that we will consider is the random shape error. 
As long as the pixel size is sufficiently large, the 
central limit theorem drives its distribution to be 
Gaussian. Since the Gaussianization that we have 
proposed is nonlinear, it will render this Gaussian 
noise into a non-Gaussian one. Even worse, the 
same nonlinear transformation could mix the lens- 
ing signal and measurement noise. 

• The redshift evolution. We have shown the effec- 
tiveness of the Gaussianization to reduce the non- 
Gaussianity in the redshift resolved 6^ field. But 
it does not necessarily make the resulting y field 
evolve linearly. Further investigations are needed 
to see if a choice of suitable scaling y ^ ay will 
result J/ in a linearly evolving field. 

• Understanding the k ^ y relation. We need many 
more simulation realizations to robustly calculate 
and understand this relation and its dependence 
on cosmology, pixel size, redshift and other factors. 
It will be interesting to explore the cosmological 
information encoded in this relation. 

• The residual non-Gaussianity. The proposed Gaus- 



sianization works surprisingly well, but it does not 
perfectly produce a Gaussian random field. The 
residual non-Gaussianity we have detected is weak 
and is unlikely to carry a significant amount of cos- 
mological information. Nevertheless, it may be still 
worth investigating the cosmological information 
carried by the residual non-Gaussianities. 

2D projected matter distribution 6^ is the projection 
of 3D matter distribution along one direction, and weak 
lensing convergence field k is a summation of Sf with dif- 
ferent weight function. Since copula describe the struc- 
ture of the random field and how the one-point PDF join 
together to get n-point joint PDF, we hope that projec- 
tion and summation leaving copula Gaussian could be 
proofed. Since galaxy distribution is a biased tracer of 
matter distribution, it seems to keep rank order. Hence 
we hope that galaxy distribution also has Gaussian cop- 
ula. We expect that Copula will be a useful tool to ap- 
proach high-order statistics. There have already been 
some works in cosmology using copula. Sato et al. 2010 
(a)&(b) ^]l32] use Gaussian Copula and one-point 
PDF to describe the power spectrum estimation. They 
found that the likelihood derived in this way is more sim- 
ilar to the likelihood directly derived from simulation, 
than Gaussian likelihood hypothesis usually made in cos- 
mological parameter estimation pipeline. Hence the pa- 
rameter constrain is more accurate and the degeneracy 
is expected to be broken. 
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